Comparison of Performance of Classification Algorithms Using Standard Deviation-based Feature Selection in Cyber Attack Datasets
نویسندگان
چکیده
Supervised machine learning techniques are commonly used in many areas like finance, education, healthcare, engineering, etc. because of their ability to learn from past data. However, such can be very slow if the dataset is high-dimensional, and also irrelevant features may reduce classification success. Therefore, feature selection or reduction overcome mentioned issues. On other hand, information security for both people networks crucial, it must secured without wasting time. Hence, approaches that make algorithms faster reducing success needed. In this study, we compare run-time performance state-of-the-art using standard deviation-based aspect datasets. For purpose, applied KDD Cup 99 Phishing Legitimate datasets selecting most relevant features, then run selected on results. According obtained results, while all satisfying Decision Tree (DT) was best one among others. Tree, k Nearest Neighbors, Naïve Bayes (BN) were sufficiently fast, Support Vector Machine (SVM) Artificial Neural Networks (ANN NN) too slow.
منابع مشابه
Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets
Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...
متن کاملFeature-based Malicious URL and Attack Type Detection Using Multi-class Classification
Nowadays, malicious URLs are the common threat to the businesses, social networks, net-banking etc. Existing approaches have focused on binary detection i.e. either the URL is malicious or benign. Very few literature is found which focused on the detection of malicious URLs and their attack types. Hence, it becomes necessary to know the attack type and adopt an effective countermeasure. This pa...
متن کاملfast sffs-based algorithm for feature selection in biomedical datasets
biomedical datasets usually include a large number of features relative to the number of samples. however, some data dimensions may be less relevant or even irrelevant to the output class. selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. to this end, this paper presents a hybrid method of filter and wr...
متن کاملComparison of Performance in Image Classification Algorithms of Satellite in Detection of Sarakhs Sandy zones
Extended abstract 1- Introduction Wind erosion as an “environmental threat” has caused serious problems in the world. Identifying and evaluating areas affected by wind erosion can be an important tool for managers and planners in the sustainable development of different areas. nowadays there are various methods in the world for zoning lands affected by wind erosion. One of the most important...
متن کاملAn ensemble approach for feature selection of Cyber Attack Dataset
Abstract— Feature selection is an indispensable pre-processing step when mining huge datasets that can significantly improve the overall system performance. Therefore in this paper we focus on a hybrid approach of feature selection. This method falls into two phases. The filter phase select the features with highest information gain and guides the initialization of search process for wrapper ph...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International journal of pure and applied sciences
سال: 2023
ISSN: ['2149-0910']
DOI: https://doi.org/10.29132/ijpas.1278880